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MOTION VECTOR SELECTION BASED ON A PREFERRED POINT 

BACKGROUND OF THE INVENTION 

Field of the Invention 

The present invention relates to a method and associated apparatus 
5 for selecting motion vectors for use in a video motion estimator. Such video 
motion estimators are typically used in video processing applications, particularly 
for the purpose of video compression. 

Description of the Related Art 

MPEG (Moving Picture Expert Group) standards are used in many 

10 current and emerging video products. They are implemented in digital television 
set-top boxes, video-telephone, High Definition Television (HDTV) decoders, video 
conferencing and other applications. These applications benefit from video 
compression in that they require less storage capacity for archived video 
information, less bandwidth for the transmission of the video information, or a 

1 5 combination of both. 

Various video compression techniques are used to reduce the 
substantial amount of data involved in encoding video images. One such example 
is a transform coding using a Discrete Cosine Transform (DCT) process that 
reduces spatial correlation. Another example is motion compensation coding that 

20 reduces temporal correlation between two successive frames. 

The motion compensation is used to predict the current frame data 
from previous frame data based on an estimation of motion between the current 
and the previous video frame. The premise of this motion estimation is that in 
most cases, consecutive video frames will be similar except for changes induced 

25 by objects moving within the frames. In the trivial case of zero motion between 
frames, it is easily and efficiently predicted that the current frame is a duplicate of 
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the prediction frame. In such a case, the only information needed to be 
transmitted to the decoder is the syntactic overhead (i.e., the coding scheme used 
to indicate the identity between successive frames) necessary to reconstruct the 
picture from the original reference frame. 
5 When there is motion in the images, the displacement of part of the 

current frame with respect to previous or future frame is estimated. Therefore, one 
or more motion vectors, representing the displacement of certain pixels between 
the previous and the current frames, are determined to represent the motion. 

Motion vectors from a sequence of video frames are usually 

10 determined by a block matching algorithm. The algorithm is arranged to search for 
the best match block of the current frame from the candidate blocks inside a 
search window in the reference frame. The block matching algorithm divides the 
image into square blocks which contain 'N x N' pixels, and then compares each 
block in the current frame (current block) with those within a reduced area of the 

1 5 reference frame (search window) looking for the closest match. 

Absolute error (AE) is the sum of differences in the luminance level of 
all pixels between the current block and reference block. And the smallest AE 
among the calculated AEs is used to determine the motion vector as a measure of 
the displacement of the pixels between the reference block and the candidate 

20 block in the search window. Alternatively, Square Error (SE), the square of the 
sum of differences in the luminance level of all pixels between the current block 
and the reference block, may be used for motion vector determination. 

Within a search window, there may be a plurality of equivalent AEs 
found during block matching. This particular problem is addressed in United State 

25 Patent No. 5,327,232. According to this patent, the AEs are compared and a 
subset, consisting of all minimum AEs equal to the smallest value, is defined. A 
further subset of candidate vectors from the subset of said minimum AEs is then 
derived. Each candidate vector, representing the displacement of the pixels 
between the current block and the search window, represents the corresponding 
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AE equal to the smallest value. A predetermined motion vector, retrieved from a 
storage means, is compared with the subset of said candidate vectors to 
determine the candidate vector closest to the predetermined motion vector. The 
closest candidate vector is defined as the motion vector of the current block and is 
5 then stored in the storage means as the predetermined motion vector for 
subsequent current blocks. 

A norm function with reference to a user defined preferred point can 
be used to select one of the motion vectors, the preferred point having been 
previously described in the search window. According to United State Patent No. 

10 5,717,61 5, the norm function of the said vectors is created according to the 

function: max{|X|,|Y|} or ^(X 2 +Y 2 ) , where X and Y represent the co-ordinates of 
a vector. The norm of the search points are computed and compared when the 
corresponding AE values are equal to the said smallest AE. 

Calculation of the norm function for each position in the search 

1 5 window demands a large amount of computational capacity. The cost of 

implementing -\](X 2 + Y 2 ) for each position is high as a large processing capacity 
must be provided. The accuracy of max{|X|, |Y|} affects the quality of detected 
motion vectors, and hence the decoded image. It is therefore difficult to use 
approximations to reduce the computational overhead without adversely affecting 

20 image quality. 

Typically, multiple systolic arrays (e.g., 4x16x8, or 8x8x8, etc) are 
used for searching a larger search window. The number of norm functions to be 
calculated is proportional to the number of arrays. Furthermore, the number of 
comparators required to compare AEs in parallel with the norms is significantly 

25 increased. For each search position, two comparators are needed for the AE and 
the norm value. Therefore, for a multiple systolic array of 4x16x8, two 
comparators are needed for sequential comparison. If the number of multiple 
systolic arrays is increased to four, 4x2 comparators are required for determination 
of the motion vector. 
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BRIEF SUMMARY OF THE INVENTION 

According to a first aspect of the present invention, there is provided 
a method of selecting a motion vector for use in a motion estimation system, said 
motion vector defining movement of a block of pixels between a search window 
and a reference frame, said method including the steps of: 

• scanning a search window in a fixed order for a suitable match with 
an area of the reference frame such that each search position within 
a current row or column of the window is one pixel away from a 
previous search position; 

• defining a preferred point in the search window; 

• calculating a primary norm function with reference to the preferred 
point for a given position in each row or column; 

• calculating a norm function for each search position based on a 
linear increment of the primary norm function for the respective row 
or column; 

• calculating an Absolute Error (AE) for each search position, based on 
a difference between a given property of the search position and a 
position in the reference window; and 

• selecting a motion vector, the displacement of the motion vector 
being defined in terms of the search position having the smallest AE, 
or if more than one search position has the same smallest AE, the 
search position having the smallest norm function is selected. 
Preferably, the step of calculating the primary norm function for a 

given position includes the step of calculating the square of the distance between 
the preferred point and the current search point. The square of the distance is 
used, as this obviates the need to perform a square root operation, and only 
relative values of the norm function are needed. 

Preferably, the linear increment for each row or column is (1+2X) or 
(1 +2Y) respectively, where X and Y refer to the respective co-ordinates of the 



current search position. In this way, adjacent search positions may be evaluated 
without the need to perform computationally complex, and hence, lengthy, square 
calculations. 

Advantageously, such a linear increment in no way detracts from the 
5 accuracy of the calculation. 

Multiple systolic array arrangements may be used to increase speed 
of calculation. This allows pre-selection of multiple scan positions and reduces 
multiple calculations and comparisons of AE and norm values. 

Preferably, a pre-selected motion vector is determined on the basis 
1 0 of the AE calculations for said plurality of positions, and the motion vector is 
selected on the basis of the AE and norm functions of the pre-selected motion 
vectors. 

Preferably, the calculated AE results for said plurality of positions are 
sorted according to the distance of their row or column position from the preferred 
15 point, and the pre-selected motion vectors are determined on the basis of the 
sorted AE results. 

Preferably, the method may be used within the framework of the 
MPEG standards. In particular, the reference frame may be one of a forward, 
backward, or interpolated frame. 
20 In the foregoing method, a Square Error (SE) calculation may be 

used in place of the Absolute Error (AE) calculation. 

According to a second aspect of the present invention, there is 
provided a motion estimation device for calculating a motion vector defining a 
displacement between a block of pixels in a search window and a block of pixels in 
25 a reference frame, including: 

• means for scanning a search window in a fixed order for a suitable 
match with an area of the reference frame such that each search 
position within a current row or column of the window is one pixel 
away from a previous search position; 
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• means for defining a preferred point in the search window; 

• means for calculating a primary norm function with reference to the 
preferred point for a given position in each row or column; 

• means for calculating a norm function for each search position based 
on a linear increment of the primary norm function for the respective 
row or column; 

• means for calculating an Absolute Error (AE) for each search 
position, based on a difference between a given property of the 
search position and a position in the reference window; and 

• means for selecting a motion vector, the displacement of the motion 
vector being defined on the basis of the search position having the 
smallest AE, or if more than one search position has the same 
smallest AE, means for selecting the search position having the 
smallest norm function. 

A method of selecting a motion vector for use in a motion estimation 
system is provided, the motion vector defining movement of a block of pixels 
between a search window and a reference frame, the method including: scanning 
a search window such that each search position is one pixel away from a previous 
search position; calculating a primary norm function with reference to a preferred 
point for a given position; calculating a norm function for each search position 
based on a linear increment of the primary norm function for a respective row or 
column; calculating an Absolute Error (AE) for each search position; and selecting 
a motion vector having a displacement defined in terms of the search position 
having the smallest AE and the smallest norm function. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 is a block diagram of a search window showing the rows of 
search positions and the preferred point; 
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Figure 2A is a block diagram showing the scanning order of a search 
process in a search window; 

Figure 2B is a block diagram showing the distance between two co- 
ordinates in a search window; 
5 Figure 3 is a block diagram illustrating an arrangement for performing 

motion estimation for every search position; 

Figure 4 is a block diagram illustrating an arrangement for 
determining the motion vector in accordance with a reordering scheme; and 

Figure 5 is a block diagram illustrating an arrangement for 
10 determining the motion vector in accordance with the scheme: Norm i+1 = Nornrij + 
(2X+1). 

DETAILED DESCRIPTION OF THE INVENTION 

Using a fixed scanning order for the search window, e.g., left-to-right, 

top-to-bottom, the distance between the previous search position and the current 
1 5 search position is one pixel horizontally (or vertically) within the current scan 

column (or row). Therefore, it is only necessary to update one co-ordinate to 

minimize calculation of the norm during the scan. 

Furthermore, if Nornij of a current position i is (X 2 +Y 2 ), the Norm j+1 of 

the next scan position i+1 is given by ((X+1) 2 +Y 2 ) or (X 2 +2X+1 +Y 2 ); hence 
20 Normj +1 = Normj + (2X+1). This result means that mathematical square operations 

can be avoided during the scan by utilizing previously stored norm values. 

For the case of a multiple systolic array implementation, or parallel 

scanning of several search positions, the scan positions are aligned to the same 

row and sorted according to the column distance from the preferred point as the 
25 row distances are the same. This will allow pre-selection of the multiple scan 

positions and therefore minimizes multiple calculations of the norm values as well 

as the number of comparisons required. 
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A current block of pixels is selected from the current frame, and a 
larger block of pixels, called a search window, is selected from the reference 
frame. A preferred point, which is defined by a user, is set in the search window. 
The preferred point is a fixed point, which serves as a reference for all norm 
5 calculations. Any point located outside the search window can be truncated to the 
closest point within the search window. Every position in the search window is 
then compared with the current block, the comparison starting from a corner of the 
search window. X 2 and Y 2 for the norm function is calculated and stored with 
reference to the starting scan point and the preferred point. 

10 The scanning process to determine the best match block is 

performed column-by-column first then followed by row-by-row. The Absolute 
Error (AE) between the current block and the candidate block from the search 
window is used to determine and define the best match block during the scanning 
process. During the scanning process, a norm function is calculated for each 

1 5 search position to determine the distance between the starting scan point and the 
preferred point. A norm is calculated for each search position by updating only 
one of either the X 2 term or the Y 2 term depending on whether the search position 
is changing by a column or a row. The updating process may be further simplified 
to the form of a 1±2X or 1±2Y increment as appropriate. 

20 In the scanning process, it would be desirable and convenient to find 

only one minimum AE over the entire search window active with the corresponding 
current block. However, there may be a plurality of equivalent minimum AEs found 
during the process. If there are multiple positions with the same minimum AE, the 
choice of motion vector will be based on the norm function. In particular, the 

25 position with the smallest norm will determine the displacement of the vector. This 
means that the search position where the starting scan point is closer to the 
preferred point will determine and define the displacement of the vector. 

In order to ease the selection of the motion vector, when multiple 
search positions are scanned simultaneously, the resulting AEs and norms are 
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sorted according to the row position with reference to the preferred point. Firstly, 
the selection is made based on the lowest AEs and sort order. Sort order 
represents the selected row position with reference to the preferred point. Then 
the output is compared with the previous results of AEs. When the AE values are 
5 equal, corresponding norm values are compared to determine the motion vector, 
with the smallest norm determining and defining the optimum vector. 

Referring to Figure 1 , there is shown a search window 1 00 in a grid 
format where each grid area is equal to 1 pixel. An example preferred point 1 10 is 
defined for the search window as illustrated in Figure 1 . For a scanning order 

1 0 running from left-to-right and top-to-down, when the scanning of the first row of 
search positions is completed, a new vertical iteration scan is executed, and it 
scans the next row of search positions, starting at the same horizontal position. 
The AEs and the distances with reference to the preferred point of this new row of 
search position are calculated. The vertical iteration is repeated until the bottom of 

1 5 the search window is reached. 

As shown in Figure 2A, the first block to be scanned with the current 
block is located at the top left corner of the search window. The scanning process 
continues with blocks in the same column, from top to bottom. A new horizontal 
iteration will be started at the new horizontal position when a column of vertical 

20 iteration is finished. The vertical and horizontal iteration is repeated until the whole 
search window has been scanned, (x) and (y) co-ordinates therefore continue 
increasing by one pixel during the scan of the search window. The values of xO 
and yO correspond to the preferred point of the search window with respect to the 
position of the current block. This relation is illustrated in Figure 2B. 

25 During the scanning process, the search position which gives the 

smallest value of AE is determined and defined as the best AE of the current 
search mode. When there are equivalent values of AE for other search positions 
in the same search mode, a norm function is used to determine the position having 
the best AE. The norm function is defined as-^X 2 + Y 2 ) , whereby the square root 
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can be removed during computation, as this is common to all calculations, and 
only relative values are required. This norm function represents the distance of the 
starting search position of the current block with reference to the preferred point in 
the search window. The starting search position of the current block is at the top 
5 left comer, therefore the first norm function calculated will be the distance between 
the co-ordinate x and y of top left point of current block with reference to the co- 
ordinate x and y of preferred point. The norm function can be defined as: 

(topleft__x - prefer_x) 2 + (topleft_y - prefer_y) 2 

The scanning order is left-to-right and top-to-down and the search 
1 0 window area is divided into a grid format of pixel by pixel. Therefore, for a new 
vertical iteration, the distance between the previous search position and current 
search position is one pixel vertically, and for a horizontal iteration, it is one pixel 
horizontally. 

As shown in Figure 3, when the scanning process begins at the first 
15 vertical iteration of a column in the search window, co-ordinate x of the top left 
corner of the current block and co-ordinate x of the preferred point are chosen as 
two inputs to a norm calculation circuit. A multiplex function is performed by circuit 

301 and circuit 305. Subsequently the corresponding co-ordinate y of the same 
point and co-ordinate y of the preferred point are chosen as two inputs to the norm 

20 calculation circuit. For this first vertical iteration of the column, the input of co- 
ordinate y from circuit 305 is chosen. Circuit 304 performs this multiplexing 
function and the output of it is loaded into register 303. Circuit 302 performs the 
subtract operation for the two inputs of the chosen co-ordinate x and y from circuit 

302 and circuit 303. The output of the subtraction circuit 302 is input to the square 
25 circuit 306. After the square operation, each X 2 and Y 2 is input to a norm 

calculation circuit 309. The norm calculation circuit includes an adding function 
which is performed to complete the norm function and the X 2 is then loaded for 
subsequent norm calculation. 
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As well as being input to the subtraction circuit 302, the output of 
register 303 is also supplied as an input to circuit 307. This co-ordinate y value is 
incremented by 1 in circuit 307. This new value of co-ordinate y is the co-ordinate 
y of the next new vertical iteration in the same column. For a new vertical iteration, 
5 the value of co-ordinate y increases by one pixel. Register 303 and circuit 307 are 
used for calculating the increasing value of co-ordinate y corresponding to the 
vertical iteration. For a new vertical iteration, multiplexor 304 will choose the 
output signal from circuit 307, while the output signal of circuit 301 is the co- 
ordinate y of the preferred point. These two signals are input to the subtraction 
10 circuit 302 and then are applied to square circuit 306 as well as norm calculator 
309 to produce a norm value for the current new vertical iteration with the X 2 value 
which has already been loaded. This sequence is repeated for every search 
position in the top-to-down scanning process until the bottom of the column is 
reached. 

1 5 When there is a new horizontal iteration for a new column, the 

operation described above is repeated. By fixing the scanning order to top-to- 
down and left-to-right for the whole search window, the norm value is calculated for 
every search position by using one subtractor, one square circuit and 2 adders. 
Moreover, the results are obtained without any approximation and so the accuracy 

20 of the displacement of motion vector is guaranteed. However this design applies 
to a one systolic array implementation, and only one search position is scanned at 
any one time. 

The AE value of every search position is calculated and is used to 
determine the best match block. Two AE values of different search points are 
25 compared to define the best AE of current vertical iteration. Comparator 310 has 2 
inputs of AE value, one of which is the defined smallest AE value. For the 
comparison between these two AE values, the smaller AE value will be loaded into 
register 308 and kept as a defined smallest AE value. This value is used to 
compare with the incoming AE value from different search position. As well as 
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being loaded as the smallest AE value into register 308, it will also be output to the 
next circuit and defined as the minimum AE value in the current vertical iteration. 
The comparison between norms for corresponding AE values is not needed. The 
norm value for the said minimum AE value will be loaded to the circuit 31 1 and 
5 circuit 31 5. 

When the comparator 310 finds that the current search position has 
an equivalent AE value to the previously defined minimum AE value, a comparison 
between two norms is performed. The norm value for the current search point is 
calculated and output from norm calculator 309. This new norm value is compared 

10 with the norm corresponding to the defined minimum AE value which is stored in 
register 311. If the value of the norm of the current search position is smaller than 
the norm of the stored minimum AE, this new value will be loaded to the register 
311 and also input to circuit 315. Circuit 313 will output an enable signal to circuit 
314 for selecting a new minimum AE, and it is thus defined as the best AE for the 

1 5 current vertical iteration. 

In circuit 315, the new defined minimum AE for the current vertical 
iteration is compared with the previous defined minimum AE value. If these two 
values are equal, the corresponding norm value is used to determine the 
displacement of the vector. Circuit 31 5 stores the smallest value of AE for the 

20 current search mode and its corresponding norm value. Comparison between two 
defined minimum AE values continues until the whole scanning process is 
completed. The output of circuit 31 5 is defined as the minimum AE for the current 
search mode and it represents the optimum displacement of vector for the motion 
estimation. 

25 Other embodiments are applicable to multiple systolic array 

implementations, or parallel scanning of several search positions. Comparison of 
the norm value is based on the position of the scan point with reference to the 
preferred point in the search window. The search position which is closest to the 
preferred point is chosen to define the displacement of vector. Therefore, a sorter 
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design is included to simplify the complexity of the circuit and reduce the cost of 
computation. In this embodiment, the search position is aligned to the same row 
(i.e., having the same y co-ordinate) and the comparison among the AEs is 
executed simultaneously. 
5 Referring to Figure 4, the AEs of 8 search positions are compared 

simultaneously. The scanning order of the search process is fixed to left-to-right 
and top-to-down, as before. At the first row of search positions for a column of 
vertical iteration, the co-ordinate x of the first search position in the row is chosen 
as an input to the norm calculation circuit. Circuit 401 and 405 take care of this 

1 0 multiplex function where co-ordinate x of the current search position and the 

preferred point are selected. For the co-ordinate x of the current search position, 
circuit 404 outputs this signal and loads it into register 403 for outputting to 
subsequent circuits. The distance between the current search position and the 
previous search position is one pixel; therefore, register 403 and adder 406 are 

1 5 used to increase the value of co-ordinate x by one pixel for subsequent search 
positions in the same row. Circuit 404 chooses the co-ordinate x of the first search 
position of the row , and in conjunction with adder 406, increments this value for 
each subsequent search position. 

Register 403 stores every output signal from circuit 404 and outputs 

20 it to the adder 406 as well as subtraction circuit 402 to perform the calculation: 
(topleft_x - prefer_x). The output from circuit 402 is passed to register 408 and 
square circuit 407. Register 408 stores only the subtraction result of 
(topleft_x - prefer_x) for the first search position, and here defines it as 
(Ao - prefer_x) which represents the column distance with respect to preferred 

25 point. Register 408 discards all other incoming subtraction results once the result 
of the first search position is complete. Meanwhile in square circuit 407, the 
subtraction result from circuit 402 is squared. The output of this circuit 407 is 
loaded into a register chain 409. Register chain 409 stores all the square results 
of 8 search positions and it is locked once all 8 square results are loaded. 
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When (A 0 - prefer_x) is locked in register 408 and all X 2 results for 
the current row of search positions are locked in register chain 409, the norm 
calculation circuit is ready for co-ordinate y computation. By using the same 
circuit, the norm function calculation is performed for co-ordinate y. This simplifies 
5 the circuit design and allows the circuit to be reused. At the first vertical iteration of 
the column, co-ordinate y of the current row of search positions and co-ordinate y 
of the preferred point are chosen as inputs to norm calculation circuit. Circuit 401 
and circuit 405 take care of this data selection. These inputs are used to perform 
the (toplefl_y - prefer_y) operation. The subtraction result is then input to circuit 

1 0 407 to perform a square operation and result of Y 2 is output to circuit 41 5. 

The distance between co-ordinate y of the current vertical iteration 
and the next vertical iteration is horizontally one pixel. Therefore, co-ordinate y for 
a new vertical iteration is increased by one pixel. For a new vertical iteration, 
circuit 406 performs the adding operation for co-ordinate y, previously loaded into 

1 5 register 403. Circuit 404 ensures that when starting from the current vertical 
iteration until the bottom of the column, only the increased result from circuit 406 
will be output to register 403. Register 403 outputs the increased co-ordinate y to 
subtraction circuit 402. Again, the same process is performed to produce the 
square result of Y ? . Y 2 of the current vertical iteration is calculated and meanwhile 

20 corresponding AEs of current row of search positions are input to the circuit for 
reordering and comparison. 

The AEs of the current row of search positions are input to circuit 410 
for reordering. The reorder process is based on the (A 0 - prefer_x) value from 
register 408. There are 8 AE values to compare simultaneously and the order of 

25 the AEs is arranged with respect to co-ordinate x of first search position in the row. 
A 0 is the co-ordinate x of the first search position in the row and prefer_x 
represents the co-ordinate x of preferred point. (A 0 - prefer_x) represents the 
column distance between these two co-ordinates x. If the result of (A 0 - prefer_x) 
is equal to or greater than 0, then the order of AE values is maintained. If the 
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value of (A 0 - prefer_x) is equal to or less than -7, the order of AE values is 
reversed with respect to the original order. If the result of (A 0 - prefer_x) is 
between -1 to -6, then the order of AE values is sorted according to the column 
distance of each search position with reference to preferred point. The search 
5 position with the smallest column distance has its AE placed at the beginning of 
the order. The order is thus proportional to the column distance. 

Before the reordering, the AEs are ordered as the alignment of the 
search position in a row. Each AE is given a number according to the alignment of 
the search position, and the number increases from left to right. When the AEs are 
10 reordered, their representative number is also sorted in the same new order. This 
new order set is output to circuit 41 3. This order set is maintained so that the 
corresponding X 2 value can be traced back to where it is stored in register chain 
409. 

After the reordering, the new order set of AEs is output to circuit 41 1 . 

1 5 This circuit is arranged to compare AEs. The AE with smallest value is chosen 
and defined as the best AE for the current vertical iteration. If there are equivalent 
AEs, the AE which is positioned first in the order set is chosen, since it is closer to 
the preferred point. When the selection of the best AE for the current vertical 
iteration is completed/the selection result is output to circuit 413. By using this 

20 arrangement, the search position closest to the preferred point is selected and 
stored. 

The best AE of the current vertical iteration is compared to the best 
AE of the previous vertical iteration. In this case, the corresponding norm value is 
needed if there is equivalence between two AEs. X 2 values for 8 search positions 
25 are stored in register chain 409, and one of the X 2 values will be chosen 
corresponding to the defined best AE for the current vertical iteration. The 
selection is performed in circuit 414, and this selection is based on a selection 
signal from circuit 41 3. The output of circuit 41 3 is based on the order set and the 
selection result from circuit 41 1 . The selection result from circuit 413 tells the 
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selector 414 which AE is chosen from the reordered AEs. Referring to this 
selection result, the representative number for the chosen AE is obtained. This 
representative number is then output to circuit 414. Based on the representative 
number from selector 413, the corresponding X 2 value is chosen and output to 
5 adder 41 5. 

In circuit 415, X 2 and Y 2 for the corresponding AE value is obtained, 
and the norm value is calculated. When the best AE of the current vertical iteration 
is equal to the best AE of the previous vertical iteration, the calculated norm from 
adder 415 is compared to the previous norm. The AE with the smaller norm value 

10 is chosen and stored in circuit 412. Circuit 412 keeps the smallest AE value and 
its corresponding norm, and the comparison is repeated for the next AE. When 
the scanning process for the whole search window is completed, the output of 
circuit 412 is defined as the best AE of the current search mode. 

This circuit design uses only one subtractor, one square circuit and 2 

1 5 adders to perform the norm calculation. It allows the comparison of 8 AEs or even 
more to be performed simultaneously. This helps avoid unnecessary 
computations and requires a less complex circuit design. 

If Norrrij of a current position i is (X 2 +Y 2 ), the Norm i+ i of next scan 
position i+1 is given by ((X+1) 2 +Y 2 ) or (X 2 +2X+1+Y 2 ); hence Norm;, = 

20 Nornrii + (2X+1 ). The square operations can be avoided during the scan by using 
previously stored norm values. With a fixed scanning order of the search window, 
for example left-to-right, top-to-down, a circuit is presented to implement 
Norm i+1 = Normi + (2X+1 ). 

The implementation of the function is shown in Figure 5. A row of 

25 search positions in the search is used to do the scanning process. Firstly, the 
norm of the first position in a row, (x 0 2 +yo 2 ) is loaded into a register for the 
subsequent norm calculation. To compute the norm for subsequent search 
positions, the X value is required. In circuit 503, input of co-ordinate x of the first 
search position in a row is chosen, and this input is incremented by 1 in adder 502. 
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The output of adder 502 is passed to subtract circuit 501 as well as loaded into 
register 505. Register 505 is used to store all increased x, and output to selector 
503. When the x value for the next search position is required, selector 503 will 
choose the output of register 505. The selector chooses the output of register 505 
5 until the scanning process comes to the end of the row of search positions. 

The value of prefer_x is input to subtract circuit 501 . The output of 
adder 502 is used to subtract prefer_x, and X value is obtained at this point. The 
result of the subtraction is output to circuit 504. Circuit 504 is a one bit shifter 
which performs the same function as a multiply by 2 circuit. The output of circuit 

1 0 504 is incremented by 1 in adder 506 and the value of (2X+1 ) is obtained. (2X+1 ) 
is added to the norm of the previous search position where has been loaded into 
register 510. Therefore, the norm of the second search position is obtained and 
loaded into register 51 1 . 

From register 51 1 , the output is passed to comparator 514 and 

15 selector 509. For the first search position in a row, selector 509 chooses (x 0 2 +yo 2 ) 
as its output. When the scanning process begins for the second search position, 
selector 509 selects the input from register 51 1 as its output. The norm of the 
current search position is stored in register 510, and used to compute the norm 
function for the search position next to it. The steps mentioned above are 

20 repeated until the scanning process for a row of search positions is complete. 
When a new row of search positions is used for the scanning process, the 
operation of the circuit starts from the beginning, but a new value of (x 0 2 +yo 2 ) is 
required because the vertical position in the search window is changed. The value 
of co-ordinate x stays the same for as long as the row of search positions starts 

25 from the same horizontal position in the search window. The operation is repeated 
until the whole search window has been scanned. The outputs of register 510 and 
51 1 are passed to comparator 514 when the corresponding AEs are compared. In 
comparator 514, the smaller norm is chosen as an output and loaded into register 
513 for subsequent comparisons. 
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The AE of every search position is input to comparator 512 to 
determine the smallest AE. Register 508 is used to store the previous minimum 
AE for subsequent comparisons. If there is no equivalent value between the 
current AE and the previous minimum AE, the minimum AE is output to selector 
5 517, and, by default, selector 517 chooses this minimum AE and outputs it as the 
best AE of the current row. However, when the current AE is equal to the previous 
minimum AE, the comparison of corresponding norms is required to determine the 
best AE. A selection signal is generated by circuit 51 5 to enable selector 51 7 to 
select the AE with the smaller norm as the best AE of the current row. 

10 The best AE of the current row and its corresponding norm are input 

to circuit 516. In circuit 516, a comparison between the best AE of the current row 
and the previous best AE is performed. When there is equivalence between two 
AEs, then comparison of the corresponding norms is required, The AE with the 
smaller norm is chosen as the best AE of the current search mode and it is stored 

15 in circuit 516 as well as its corresponding norm. Implementation of 

Norm j+1 = Normi + (2X+1) in the embodiment described above saves computation 
time and simplifies the design. This is because a square function is not required in 
the hardware design, and the square of X and Y can be performed in software 
before being input to the hardware circuit. This circuit design may be used in an 

20 arrangement for the parallel scanning of several search points when a register 
chain is added between registers 510 and 511. 

The disclosed embodiments of a sorter in a norm calculation circuit 
allow pre-selection of the multiple scan positions and minimizes multiple 
calculations of the norm values as well as the comparisons required. With this 

25 invention, the norm calculation circuit requires one square circuit and one 

comparison. The comparison is only required when there is equivalence of AE 
value. The embodiments of the invention simplify the calculation method but do 
not change the norm function. This maintains a good performance for motion 
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vector detection because the accuracy of the norm value is maintained and the 
accurate displacement of vector is obtained. 

The provision of a sorter in the norm calculation circuit and the 
provision of a circuit that implements Norm i+ i = Norrrii + (2X+1) allows the 
implementation of a multiple systolic array, or parallel scanning of several search 
positions. The searching process is executed simultaneously for the search 
positions aligned in a row. This allows the motion estimation process to be 
completed in a shorter time, as well as reducing the complexity of the circuits 
required to perform the process. 

The present invention includes a novel feature or combination of 
features disclosed herein either explicitly or any generalization thereof regardless 
of whether or not it relates to the claimed invention or mitigates any or all of the 
problems addressed. 
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